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Abstract 

' The paper estimates the Chernoff rate for the efficiency of quantum 

, hypothesis testing. For both joint and separable measurements, approx- 

imate bounds for the rate are given if both states are mixed and exact 
| expressions are derived if at least one of the states is pure. The efficiency 

, of tests with separable measurements is found to be close to the efficiency 

of tests with joint measurements. The results are illustrated by a test of 
quantum entanglement. 

1 Introduction 

£^ ' Mark Kac once called Probability Theory measure theory with a "soul" pro- 

. vided by Physics, games of chance, Economics or Geometry. 1 In a sense then, 

' Quantum Statistics can be called probability theory with a "subconscious" . The 

(*C) . probability distributions, so important for classical statistics, are no longer the 

deepest layer of foundations but only an outward manifestation of geometry in 
. the Hilbert spaces of quantum states. This foundational change begs for a new 

look at the classical statistics results, and this paper contributes by reconsider- 
ing the Chernoff-Hocffding results about hypothesis testing. 

Why quantum statistics? Today, quantum states can be manufactured. For 
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example, in one method (Cirac and Zoller (1995) I ions are placed in a trap 



created by electrostatic potential and radio-frequency oscillations. The ions 
then are cooled by laser emission, and arranged on a line in the trap. After that, 
each individual ion can be accessed by laser pulses and their joint quantum state 
can be altered according to the researcher's wishes. This ability to built and 
manipulate quantum systems is changing our thinking about computation and 
information transmission. Suddenly, certain classic problems - the factorization 
of large integers, the search in an unstructured database, secure communication 
- are not as difficult as they always were. 

This conceptual change also affects statistics. 

For example, how can a quantum state manufacturer check if states have 
been generated faithfully? We can anticipate the statistician's answer: Select a 

* Cornerstone Research, 599 Lexington Avenue, New Y ork, NY 10022, U SA; slava@bu.edu 
1 In a preface to a book about geometric probability by |Santalo (1976J1 
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sample of the states and perform a statistical test. But now, besides designing 
the test, the statistician must play an additional role, the role of advisor on how 
to perform measurements of a sample of quantum states. Since in quantum 
mechanics both the measurement and the state determine the probability dis- 
tribution of outcomes, the choice of measurement affects the properties of the 
statistical test. 

Not all measurements are readily available. Sometimes it is possible to mea- 
sure sample states jointly, as one large quantum state, and sometimes the states 
can only be measured separately and simultaneously. Yet another possibility is 
that the states must be measured separately and sequentially. Finally, some- 
times the sample states can only be measured partially, for example, when 
each state represents several remote particles that cannot be measured jointly. 
Clearly, the efficiency of the optimal test will depend on which measurements are 
available. In this paper we will concentrate on joint and separable independent 
measurements . 

For a single state the problem of quantum hypothesis testing was solved 



by Holevo (1976) and Helstrom (1976) In this paper, I consider a different 



situation: when the researcher has access to several copies of the same state but 
may not be able to measure them jointly. 

The problem of testing using a sample of states was also considered in 



Helstrom (1976) (see also recent results by Ogawa and Nagaoka (2000) and Parthasarathy (2001) I. 



These authors considered only joint measurements and only the situation when 
one of the errors may go to zero arbitrarily slowly. In contrast, I consider a 
Bayesian version of the problem, in which the researcher aims to minimize a 
weighted average of both errors, and I consider both joint and separable mea- 
surements. 

When joint measurements are available, the problem of testing using a sam- 
ple can be solved by applying the Holevo-Helstrom result to the case of tensor 
powers of primary states. In this case, my main results provide useful bounds 
on both the expected error when the sample is finite and the rate of decline in 
error as the number of sample states grows. The bounds are given in terms of 
fidelity distance between quantum hypothesis. 

In addition I derive explicit expression for the rate of error decline in cases 
if either one of the hypothesis specifies pure state, or the states specified by the 
hypotheses commute. 

For the separable measurements, I mainly concentrate on the asymptotic 
case. If at least one of the hypotheses is pure, then the optimal separable 
measurement has the same asymptotic error rate as the joint measurement. If 
both hypotheses are mixed, then there is a measurement whose performance 
is close to the performance of the joint optimal measurement. Together these 
results imply that the loss in efficiency associated with restriction on available 
measurements is not large. 

This paper contributes only to the theory of quantum hypothesis testing. I 
do not touch on another rapidly growing area of research: quantum state estima- 
tion. For recent progress in this area see the review article by |Barndorff-Nielsen, Gill, and Jupp| 

The rest of the paper is organized as follows. Section 2 gives some basic in- 
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formation about quantum states and measurements and formulates the problem 
of quantum hypothesis testing. Section 3 gives a short summary of the Chernoff- 
Hocffding results about hypothesis testing. Sections 4 and 5 discuss joint and 
separable measurements, respectively. Section 6 presents an illustration. And 
Section 7 concludes. 

2 Quantum Hypothesis Testing 

States of quantum-mechanical objects - electrons, photons, atoms, molecules, 
etc. - are described by density matrices. A density matrix is a self-adjoint, 
non- negative operator of a complex Hilbert space with a trace of 1. In this 
paper we will be concerned only with finite-dimensional Hilbert spaces, so the 
operator is indeed represented by a finite Hermitian matrix. A particular case 
is matrices of rank one. They are projectors on one-dimensional subspaces and 
called pure states. 

States are not directly observable: they can be measured but the outcome of 
a measurement is a random variable. More precisely, measurements are sets of 
non-negative operators which are required to add up to the identity operator. 
Each operator corresponds to a particular outcome of the measurement, and 
the probability of outcome i if the state is p and the measurement is {Mi} 
is tr {Mip} . An important subclass is formed by measurements in which the 
outcomes are orthogonal projectors: MiMj — SijM i: where is the Dirac 
delta-function. 

We consider the following problem: a researcher is given a sample of N 
identical quantum states, which are either p or p 1 with the prior probability 
1/2. He aims to minimize the average probability of making an incorrect decision 
about the state by devising a system of measurements and a decision rule. Can 
we safely assume that all measurements are available to the researcher? No. 

While in some situations the researcher can make a joint measurement of 
the state that represent the total sample, most often he can do only separate 
measurements of each state in the sample. If the measurements are done inde- 
pendently of each other, then we will call them separable independent measure- 
ments. If the measurements can be done sequentially and the researcher adjusts 
the current measurement according to the results obtained in the previous mea- 
surements, then they are separable adaptable measurements. 

Sometimes, the researcher is even more restricted. This happens, for exam- 
ple, if a sample quantum state consists of two spatially remote parts and the 
researcher can only measure them separately. Mathematically it means that the 
operators of the measurement must be block-diagonal in a certain basis. This 
setup may raise interesting statistical issues about identification of the state 
properties. 
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3 Classical Chernoff-Hoeffding Bounds 



This section reviews results by |Chernoff (1952)l|Sanov (1957)] amd |Hoeffdfng ( 1965) 



about asymptotic error rates in hypothesis testing. For details the reader can 
also consult the book bv I Cover and Thomasl 

Consider two multinomial distributions, P and Q, from one of which a sample 
is drawn and provided to a researcher. The researcher's task is to guess the 
distribution. The sufficient statistic for this problem is the empirical distribution 
of the sample, X, and the decision rule is specified by two complementary sets, 
V and Q, of probability distributions on outcomes. If X G V, hypothesis P is 
accepted; otherwise, Q is accepted. It is assumed that P £ V, and Q e Q. 

If the true probability distribution is P, it is the Sanov theorem that asymp- 
totically the probability of making an error and accepting Q is 

exp[-ND(Q\\P)}, (3.1) 

up to a subexponential factor, where D(Q\\P) is the Kullback-Leibler distance 
from P to Q: 



D(Q\\P) =mmVsiln^. (3.2) 

i—l 

It follows that the average probability of making an error declines asymptotically 
with growth in N: 

R ~ exp [-N min {D(Q\ \P), D(V\\Q)}} . (3.3) 

The maximum of the decline rate over all possible V and Q is sometimes called 
the Chernoff information distance between distributions P and Q : 

D C (P,Q) = maxmm{D(Q\\P),D(V\\Q)} (3.4) 

Hocffding proved that the optimal sets V and Q can be determined from 
the maximum likelihood principle: a distribution S belongs to V if and only 
if D(S\\P) < D(S\\Q). Intuitively, in this case distribution S is more likely to 
be observed if the true distribution is P rather than if it is Q, so hypothesis P 
should be accepted. 

For example, for multinomial distribution we have the following formula for 
the probability of error in the optimal test: 



R = \\ E pT-p x n+ E \ (3-5) 

Pi 1 ■■■Pl n «li 1 ■■■Ql" Pi 1 ■■■P^ n ><h 1 I 

X! + ...+x n =N a;i + ...+a: 7l =Af ) 

, ,1-^ E W-P X n-^-€"\\ (3-6) 
I xi + ...+x n =N ) 

' f l-i||P JV -Q iv || 1 } (3.7) 
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2 I 2 

cexp [-ND e (P, Q)] , (3.8) 
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where the sums are taken over possible results of sampling TV times from a 
multinomial distribution with n outcomes: x\ is the number of outcomes of 
type 1, X2 is the number of outcomes of type 2, and so on; Pjy and Qm are 
distributions on the sample space induced by the distributions P and Q on 
outcomes, and || - 1| -|_ is the total variation norm. 



It also turns out (see Cover and Thomas (1991) for derivation) that for the 



optimal choice of V and Q, the probability distributions S e Q and S' 6 P 
that minimize respectively D(S\\P) and D(S'\\Q) are the same and given by 
the following formula: 

P ^ (3.9) 



EN \ 1- 



where A is chosen in such a way that D(S\\P) — D(S\\Q). Knowing expression 
<|3.9|l we can derive another expression for the asymptotic probability of error: 



I N 

-lni?= min logV^V 1 "*. (3.10) 
N o<a<i b Z_^ 1 ^ ^ > 



All these derivations presuppose that P and Q are fixed. In quantum statis- 
tics the researcher has the ability to vary P and Q by choosing the measurement. 
How does this change the classical results? 



4 Joint Measurements 

In this section we look at the joint measurements of a sample of quantum states. 
The minimal expected error obtained in this case is a lower bound on the error 
achievable when the set of measurements is restricted. In addition, the theory 
for joint measurements provides a fascinating counterpart to the classical theory 
of the Chcrnoff bounds. 



4.1 Generalities 

Joint measurement of all sample states is by definition a measurement of the 
tensor product of the sample states. Thus, in effect we have the problem of 
testing two alternative hypotheses about a single - although huge - quantum 
state, the problem that was solved by Holevo and Helstrom (see, for example, 
Holevo (2001) I. In our situation we only need to determine what additional 



implications follow from the special structure of the state. 

If the hypotheses about the quantum state are given by matrices p and p 1 
with prior probability of 1/2, then according to the Holevo-Helstrom result, the 
optimal measurement is an orthogonal measurement with d outcomes, where d 
is the dimension of the Hilbert space and the outcomes are projectors on the 
eigenvectors of operator p Q — p x . The decision is made based on the following 
rule: If the measurement outcome corresponds to an eigenvector with a positive 
eigenvalue, then p is chosen; otherwise, p x is chosen. 
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The minimal expected error probability that can be achieved after the opti- 
mal measurement is given by the following formula: 



fl=^(l-^HPo-Pilli), (4- 1 ) 

where denotes the sum of the absolute values of eigenvalues. 

In our case the hypothetical states are tensor powers of the individual states, 
pf N and pf N , where 

P? N = Pi® Pi®-® Pi- (4.2) 

" V ' 

JV 

The number of outcomes in the optimal joint measurement is d , so it can be 
enormous for large values of N. The error is 

R=l(i-l\\pr-p? N \\ 1 )- (4.3) 

Note the similarity with classical expression l|H.7|l . 

What is the asymptotic rate of decline in error? Can we explicitly calculate 
the distribution of eigenvalues of pf N — pf N 7 

Initial moments of this distribution are indeed easy to calculate. Let us 
introduce a notation for the moments: 



P n =■ f Q t n dF{t) = ±tr (pf N - pf N y , (4.4) 

where F(t) is the discrete probability distribution that puts equal probability 
weight on each eigenvalue. Then the following Proposition holds 

Proposition 4.1 

Pn = ^ E (-lf k Htr(p kl -P kn )) N , (4.5) 
{fci, ...fc„} 

where {k\, ...k n } run over the set of all n— sequences ofO and 1. 

Proof: The proposition follows from the non-commutative binomial expan- 

sion of (pr-pf N T ^d the fact that tr{p^...p^) = (tr (p kl ...p k J) N . 
QED 

The advantage of this formula is that for a fixed n, the calculation is as easy 
for large as for small values of N. The difficulty is that the number of terms in 
this formula grows exponentially with n. Therefore the standard map from the 
set of moment sequences to the set of distributions is impractical. In the next 
sections we will pursue a different approach to estimation of \\p® N — pf N \\i ■ 
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4.2 Special Cases 

To get more insight about the behavior of 1 1 pf N — pf N 1 1 , it is useful to consider 
several special cases: (1) when both states are pure; and (2) when the density 
operators commute. In the first case let p Q = \ip ) (ip \ and p x = j^-J (tp^ . 
Then we have the following result: 



Theorem 4.2 If both states are pure, then the average error probability is 
R = 



1 



Asymptotically, 

Proof : Because of (14.111 , we need only to prove that for pure states 



r-p? N \\ 1 =2\/i-mi>i)\ 2N - 



We can write 



J8NU 
Pi 111 



(4.6) 



(4.7) 



(4.8) 



(4.9) 



Operator 



acts nontrivially only in a 2— dimensional 



space spanned by ipQ ly and ipf N , and it is easy to compute the operator eigen- 
values in this space. They are 



±\l^-\(^r |^r)| 2 =± v i-i(^i^)r. (4.10) 



From this and the fact that all other eigenvalues are zero, the first equality of 
the theorem follows. The asymptotic expression follows from the Taylor series 
for the square root. 
QED. 

Now consider the case of commuting p and p l . Let the distributions of 
eigenvalues be P for p and Q for p l . 

Theorem 4.3 If states commute, the average error probability is asymptotically 

i?~ cexp[-ND c (P,Q)} (4.11) 

In other words, the probability of error has exactly the same growth rate as 
in the classical case. 



2 For convenience, we use the Dirac ket-bra notation: the elements of the Hilbert space 
are denoted as \ip) , and the linear functionals on the Hilbert space are denoted as {tp\ . In 
particular, [i/'q) {ip \ is the orthogonal projector on \ipo) ■ 
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Proof: Since the density operators p and p 1 commute, we can choose the 
basis in which they both are diagonal. In this basis 

H N -pf N \\i= E IpN-j#-9N-«H ( 4 - 12 ) 

k=(kx,...,kd) 

where k is a partition of N, and (pi,...,Pd) an d {qi, •••,%) are eigenvalues of p 
and p 1: respectively. On the right-hand side we have ||P/v — Q.iv |li> the distance 
between two multinomial distributions, P/v and Qn, that arise in repeated trials 
from distributions P and Q. Therefore, because R = \ (l — \ \\pq N — pf N \\i) 
in the quantum case and R = ^ (l — | ||P/v — Qwlli) m the classical case, the 
average errors and their asymptotic growth rates are the same in the quantum 
and classical cases. 
QED. 



4.3 Bounds 

Let us now derive some simple bounds on the error probability that follows from 
known inequalities. These bounds are useful because they are rather narrow 
and easy to compute. The first set of bounds follows from inequalities between 
quantum fidelity and probability of error. The second bound is only applicable 
to the asymptotic rate of error decline, and it follows from a quantum analog of 
Stein's lemma. 

Recall that fidelity between two states is defined as follows: 

F (Pa,Pi) = tr \J 'VPoPiVPoi ( 4 - 13 ) 

where \fx is the unique non-negative definite, Hermitian matrix Y such that 
Y 2 =X. 

Theorem 4.4 Probability of error for optimal test with joint measurement sat- 
isfies the following bounds: 



\(l-^l-[F (p , Pi)f N ) <R<\[F (p ,Px)] N ■ (4.14) 



Asymptotically, 



2 log F (p , Pl ) < 1 log R < log F(p Q , Pl ) (4.15) 



If p * s pure, p = IV'o) (V'ol > ^ e probability of error satisfies a tighter upper 
bound: 

R<hF(p , Pl )] 2N = UiPo\Pi\iPo) N - (4-16) 



Proof: The first result follows from inequality (44) in Fuchs and van de Graaf (1999) 



applied to the case of the sample of N independent states, and from the fact 



8 



that F(p® N ,pf 
cise 9.21 in 



[F(p , p 1 )] JV . The second one is a consequence of Exer- 



Nielsen and Chuang (2000) For the reader's convenience, I include 



below short proofs of these results. 

The Fuchs-Graaf result states that for every pair of quantum states, p and 
p 1 , it is true that 



l-F(p , Pl ) < h\ Po - Pl \\i < y/l-F( Po , Pl f 



(4.17) 



These inequalities follow because 

(1) F (p , Pl ) — min^Q F(P, Q), where distributions P and Q arise from a 
measurement of states p and p 1 , and where F (P, Q) —: £\ \/PiQi] 

(2) ||p — px\\i = maxpg ||P — Q\\i, where P and Q come from a measure- 
ment, and where \\P - Q\\i =: J2i \Pi ~ 

(3) the corresponding inequality holds for probability distributions 



l-F(P,Q)<^\\P-Q\\ 1 <^l-F(P,Qf. 



(4.18) 



Indeed, given (1), (2), and (3), the left-hand inequality in (|4.17|l follows because 
1 - F (Po.Pi) = 1 - F (P, Q) (for certain P and Q) (4.19) 



< \\\p-Q\\i < Jlk-Pilli- 

(3) 2 (2) 2 

The right-hand inequality follows similarly. 



(4.20) 



Result (1) is from Fuchs and Caves (1995) (2) is a restatement of the 
Holevo-Helstrom result l^^- The left-hand inequality in (3) holds because 



^2 \Pi -qi\ WpI - V* 



) 2 > 2 ^i - y/pwA 



The right-hand inequality in (3) holds because 

2 



(4.21) 

(4.22) 
(4.23) 

(4.24) 



To prove the second part of the theorem, we need to prove that if p 
\ip ) (ip \ , then there is such a measurement and a decision rule that 



R< 2 W-olPilV-o) 



(4.25) 
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Take measurement {P^ ,1 — Pif> }, where P^ o is the projector on vector ip Q . 
Then the probabilities of the first and second outcomes are respectively 1 and 
if the state is ip , and (tp \ p 1 \ip ) and 1 — (tp \ p 1 |V>o) if the state is p x . Define 
the decision rule as follows: state ip is accepted if and only if the first outcome 
occurs. The expected error of this rule is \ (i{j \ p 1 \tp Q ) . 
In the case of tensor powers l|4.25|) becomes 

R<l&o\Pi\i>o) N - (4-26) 

QED. 

Note that the lower bound of inequality H4.14JI binds for pure states. This 
can be seen from l|4.6l) because for pure states F(p , p x ) — \ (ip \ | . The upper 
bound of inequality (|4.14() binds for certain commuting operators. 

Another bound follows from results by Ogawa and Nagaoka (2000) Define 
quantum relative entropy: 

D(Po\\Pi) = tr [p (logp - log pi)] . (4.27) 
Then the following lower bound on the error rate holds. 

Theorem 4.5 logmax{D(p ||Pi),-D(Pil|Po)} £ ir lo S R 

Proof: R is the average of error probabilities of two types. Say, R = jR\ + 
ii? 2 - If both R 1 and i? 2 satisfy the inequality, then R also does. |Ogawa anJ Nagaoka| 
proved that if one of the error probabilities violates this inequality, then the 
other error probability must approach one as the sample size grows, so the 
inequality will hold for the average of the error probabilities, R. 

QED. 

For an example of two-dimensional states, the bounds are illustrated in Fig- 
ures 1,2,3 and 4. The states in the example are linear combinations of the Pauli 
matrices: 

Po = \{I + a*i), (4.28) 
Pl = I (I + {b cos 6)a x + (b sin 6)a 2 ) , (4.29) 



where 





' 






1-1 




i 


> 0-2 = ( 



_°. ; ) 



Figures 1 and 2 suggests that bound from Theorem 14.51 is a good estimate of 
the error if the sample size is small and underestimates the error if the sample 
size is large. Figures 3 and 4 suggest that bound from Theorem 14. 51 is especially 
good when the hypotheses are close to completely mixed state, \l. 
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5 Separable Measurements 



In the previous section we have seen that it is difficult to compute the optimal 
joint measurement because of the high dimensionality of the problem involved. 
Besides, even if the optimal joint measurement is found, it can have an enormous 
number of outcomes, so it is hard to realize it in the laboratory. In this section we 
turn our attention to separable independent measurements. The goal is to show 
that the efficiency of a separable measurement with a small number of outcomes 
is not much smaller than the efficiency of the optimal joint measurement. 

Let us denote the probabilities of the i — th outcome as pi and qi depending 
on whether the state is p or p 1 . The following theorem about optimal measure- 
ments holds: 

Theorem 5.1 All outcomes of an optimal measurement are projectors. 

Proof: Indeed, if the measurement includes an outcome, Mo, that is not a 
projector then it can be represented as a sum of projectors with non-negative 
coefficients: 

n 

M = ^aiMi. (5.1) 

i=l 

Therefore 

n 

p Q = :tr(M p ) =^2ctiPi, (5.2) 
i=i 

n 

q = : tr(Mop x ) = } j a i q i . (5.3) 

i=l 

Since function x x y 1 ~ x is concave and homogeneous, we have the following 
inequality 

n 
i=l 

Because of l|3.1UI) , this inequality implies that we can decrease the error by using 
the set of outcomes {M 4 } instead of Mo. This contradicts the optimality of the 
measurement. 
QED. 

How many outcomes does an optimal measurement have? It turns out that 
if one of the states is pure, p = \ip ) (ip \ , then only two outcomes is needed 
- a huge reduction relative to the d N outcomes needed for the optimal joint 
measurement. 

Theorem 5.2 When one of the states is pure, there is an asymptotically opti- 
mal test with two outcomes in each measurement. The average error probability 
of the test satisfies the following bound 

R<1^o\Pi\A) N asN^^. (5.5) 
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Proof: Take measurement {P^ , I — P/i }, where P^ is the projector on 
vector ip . Then the probabilities of the first and second outcomes are respec- 
tively 1 and if the state is p , and (ip \ p x \tp ) and 1 — (ip \ p 1 \ip ) if the state 
is p l . Define the decision rule as follows: state p is accepted if and only if 
the second outcome never occurred. This rule leads to an error if and only if 
the true state is p 1 and the second outcome never occurs. Thus the average 
probability of error for this decision rule is 

R=\{%\Px\%) N ■ (5-6) 

Thus the rates of error decline coincide for the cases of joint and separable 
measurements. Since the optimal separable test cannot do better than the 
optimal joint measurement, the measurement considered is optimal. 

QED. 

If both states are mixed, then we can use the measurement that maximizes 
fidelity distance between distributions of outcomes. In other words, the mea- 
surement is chosen in such a way that it minimizes 

F{P, Q) = J2 Vp^- ( 5 - 7 ) 

We will call this measurement fidelity-optimal. The advantage of this method 
is that the fidelity-optimal measurement is easy to compute. It is simply a 
measurement with outcomes that are orthogonal projectors on the eigenvectors 
of the following operator: 



Ti-r -1/2 / 1/2 1/2 -1/2 /v q n 

M = Pl V/V p oPl ' Pl 1 . (5.8) 



(See Fuchs and Caves (1995) for an explanation why this M is fidelity-optimal.) 
This measurement has only d outcomes and their probabilities are easy to com- 
pute. For this fidelity-optimal measurement wc can write a bound on the asymp- 
totic error: 

Theorem 5.3 The asymptotic error of the test based on the fidelity- optimal 
measurement has the following asymptotic bound: 

^lo gj R<logP(p , Pl ). (5.9) 

This is the same upper bound that we have for joint asymptotic measurement 
according to Theorem 14.41 
Proof: 

j N N 

N lnR = o<t<i log ^2 P * q *~ X - log E^« <logF(p ,Pi)- (5-10) 

- - i=l i=l 

The equality holds because of 1)3. 10[) . and the second inequality is inequality 



(44) in Fuchs and van de Graaf (1999)| 



QED. 
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6 Illustration 



This section illustrates the concepts developed above with an example of testing 
for the presence of entanglement. Entanglement is one of the properties of 
quantum systems that clearly separates them from classical systems. It is a 
co-dependence of two remote parts of a quantum system that cannot be created 
or destroyed by local operations on the parts. Entanglement has become an 
important part of many quantum technologies including quantum teleportation 
and quantum cryptography. 

Entanglement has been produced in the laboratory. For example, Turchette et al. (1998) 
developed a technique in which two ions are trapped and illuminated equally 
by a laser beam that results in the creation of entanglement. 

In this illustration we are interested in tests of whether the entanglement 
has been produced or not. 

An example of an entangled quantum state is a pure state of the system of 
two particles that corresponds to the projector on the following vector: 



1 

71 



(|00> + |11» 



(6.1) 



where |00) and |11) denote |0) ® |0) and |1) ® |1) , and |0) and |1) form an 
orthonormal basis in the Hilbert space corresponding to one of the particles. 
The density matrix for this system is 



Pa = \%) (%\ 



f 1 * 








1 

2 
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1 
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) 



(6.2) 



The alternative hypothesis is that the state is a mix of two non-entangled states 



given by projectors on vectors 1 00) and |11) 
for this hypothesis is 



Pi 














\ 






























V o 








1 

2 





respectively. The density matrix 



(6.3) 



This state can be easily produced by local operations but it is useless for tech- 
nologies that require entanglement. 

Applying Theorem 15.21 we obtain the following formula for the asymptotic 
error 



R 



1 



(^oIpi IV>o! 



N 



1 



(6.4) 



2 ITUiniru/ 2 N+1 ' 

It follows that it is sufficient to measure a sample of size 3 to reduce error below 
5%. 

The components of the optimal separable measurement are the projection on 
ip and its complement. Note that this is a joint measurement of both particles. 
Actually, p and p 1 cannot be distinguished by the measurements that operate 
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on each particle separately. This problem is statistically unidentified by local 
measurements. 

7 Conclusion 

We have estimated the Chernoff efficiency bound for cases of joint and separable 
measurements and also calculated it exactly for both pure and commuting states. 
The results suggest that the loss of efficiency caused by restriction to separable 
measurements is small. 

Several questions remain open. Notably, it is not known whether the joint 
measurement can ever be asymptotically better than the optimal separable mea- 
surement. Second, it is not clear whether the optimal separable measurement 
consists of orthogonal projectors. Third, it is unclear whether the number of 
outcomes in this measurement is finite for finite-dimensional quantum states. 

References 

Barndorff-Niclscn, O. E., R. D. Gill, and P. E. Jupp (2001). 
On quantum statistical inference. Working Paper, available at 
http://www.math.uu.nl/people/gill/Preprints/qiread9statsoc.pdf. 

Chernoff, H. (1952). A measure of asymptotic efficiency for tests of a hypoth- 
esis based on the sum of observations. Annals of Mathematical Statis- 
tics 23(A), 493-507. 

Cirac, J. I. and P. Zoller (1995). Quantum computation with cold trapped 
ions. Physical Review Letters 74, 4091-4094. 

Cover, T. M. and J. A. Thomas (1991). Elements of Information Theory. 
John Wiley and Sons, Inc. 

Fuchs, C. A. and C. A. Caves (1995). Mathematical techniques for quantum 
communication theory. Open Systems and Information Dynamics 5(3), 
345-356. also available at http://arxiv.org/abs/quant-ph/9604001. 

Fuchs, C. A. and J. van de Graaf (1999). Cryptographic distinguishability 
measures for quantum-mechanical states. IEEE Transactions on Infor- 
mation Theory 45(4), 1216-1227. 

Hclstrom, C. W. (1976). Quantum Detection and Estimation Theory. Aca- 
demic Press: New York. 

Hocffding, W. (1965). Asymptotically optimal tests for multinomial distribu- 
tions. Annals of Mathematical Statistics 36(2), 369-401. 

Holevo, A. S. (1976). Investigagion of a general theory of statistical decisions. 
English translation: Proceeding of Steklov Institute of Mathematics, v. 3, 
1978. 

Holevo, A. S. (2001). Statistical Structure of Quantum Theory (1 ed.). 
Springer- Verlag. Lecture Notes in Physics. Monographs; 67. 



14 



Nielsen, M. A. and I. L. Chuang (2000). Quantum Computation and Quantum 
Information. Cambridge University Press. 

Ogawa, T. and H. Nagaoka (2000). Strong converse and Stein's lemma in 
quantum hypothesis testing. IEEE Transactions on Information The- 
ory 46{3), 2428-2433. 

Parthasarathy, K. R. (2001). On consistency of the maximum likelihood 
method in testing multiple quantum hypotheses. In Stochastics in fi- 
nite and infinite dimensions, Trends Math., pp. 361-377. Boston, MA: 
Birkhauser Boston. 

Sanov, I. N. (1957). On the probability of large deviations of random variables. 
Matematicheskii Sbornik 42, 11-44. 

Santalo, L. A. (1976). Integral geometry and geometric probability. Addison- 
Wesley Publishing Co., Reading, Mass. -London- Amsterdam. With a fore- 
word by Mark Kac, Encyclopedia of Mathematics and its Applications, 
Vol.1. 

Turchette, Q. A., C. S. Wood, B. E. King, C. J. Myatt, D. Lcibfried, W. M. 
Itano, C. Monroe, and D. J. Wincland (1998). Deterministic entanglement 
of two trapped ions. Physical Review Letters 81(17), 3631-3634. 



15 



